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INTRODUCTION TO THE THEORY OF FLUORESCENCE INTENSITY 

DISTRIBUTION ANALYSIS 



1. introduction 

The primary data of a fluorescence correlation experiment is a sequence of photon counts 
detected from a microscopic sample volume. An essential attribute of the fluorescence 
correlation analysis is the calculation of the second order autocorrelation function of photon 
detection. This is a way how a stochastic function (of photon counts) is transformed into a 
statistical function having an expected shape, serving as a means to estimate some parameters 
of the sample. However, the calculation of the autocorrelation function is not the only way for 
extracting information about the sample from the sequence of photon counts. Another 
approach, based on collecting the distribution of the number of photon counts per a given time 
interval, was introduced in fluorescence fluctuation spectroscopy by Qian and Elson in 1990/ 

The idea behind the fluorescence intensity distribution analysis (FIDA) can be well 
understood by imagining an ideal case when a sample volume is uniformly illuminated and 
when there is almost never more than a single particle illuminated at a time, similar to the 
ideal situation in cell sorters. Under these circumstances, each time when a particle enters the 
sample volume, fluorescence intensity jumps to a value corresponding to the brightness of a 
given type of particles. Naturally, the probability that this intensity occurs at an arbitrary time 
moment equals the product of the concentration of a given species and the si2e of the sample 
volume. Another fluorescent species which may be present in the sample solution produces 
intensity jumps to another value characteristic of this other species. In summary, the 
distribution of light intensity is in a straightforward way tetermined by the values of 
concentration and specific brightness" of each fluorescent species in the sample solution. 

In reality, the intensity of fluorescence detected from a panicle within a sample volume is not 
^ uniform but depends on the coordinates of the particle with respect to the focus of the optical 
system. Even though the calculation of a theoretical distribution of the number of photon 



counts is more compl x for a bell-shaped profile than for a rectangular one, the distribution of 
the number of photon counts sensitively depends on values of the concentration and the * 
specific brightness of fluorescent species. The roeaswed distributions of the number of . photon 
counts can therefore be used for sample analysis. 

The first successful realization of this kind of analysis was demonstrated on the basis of 
rnpments of the photon count number distribution."' The k-ih factorial moment of the photon 
count number distribution P(n) is defined as 



In rurn. factorial moments are closely related to factorial cumulants, 
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( C} s are binomial coefficients/ and Kin arc ciimulants.) The basic expreaaion used in moment 
analysis, derived for ideal solutions, relate fc-th order cumulant to concentrations (c, ) and 
specific brightness values (?,) 
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Here, z, »s »e Jk-lh moment of the relative spatial brightness profile B(r) 
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Qian and Elson used experimental values of the first three cumulants to determine unknown 
paramet rs f the sample. The number of cumulants which can be reliably determin d from 
experiment* is usually three to four. This sets a limit to the applicability f the mom nt 
analysis. 

In the pioneer publications on moment analysis, the idea that the count number distribution 
could be directly fitted was also discussed. The present chapter aims at presenting an adequate 
theory which has found a number of applications in biochemical assay development and drug 
screening." 

2. Photon count number distribution corresponding to a rectangular sample profile 

A key to a successful realization of the photon count number distribution analysis is an 
adequate calculation of the expected count number distribution function. Let us first consider 
a simple theoretical case when the light intensity reaching the detector from a particle as a 
function of coordinates of the particle is constant over the whole active volume of the sample, 
and zero outside it. Also, wc assume that the diffusion of a fluorescent particle is negligible 
during the counting interval 7". In this case, the distribution of the number of photon counts 
emitted by a single fluorescent species can be analytically expressed as double Poissonian: the 
distribution of the number of panicles of given species within this volume is Poissonian, and 
the conditional probability of the number of detected photons corresponding to a given 
number of particles is also Poissonian. The double Poissonian distribution has two 
parameters: the mean number of particles in the active sample volume, c and the mean 
number of photons emitted by a single particle per dwell time. qT. The distribution of the 
number of photon counts n corresponding to a single species is expressed as 



(6) 



where m runs over the number of molecules in the active volume. If P\{n) denotes the 
distribution of the number of photon counts from species i t then the resultant distribution P{ri) 
is expressed as 



This means that **(n) can be calculated as a convolution of £he series of distributions /*i(n). ■■. 
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3. Photon cbimt nuii^cf distribution corresponding to an arbitrary sample profile: the 
convolution technique 1 

■ ■ . 'V • ' ' • . ' 

Like in FCS, the rectangular sample profile is a theoretic*] model which can hardly be applied 
* in experiments. The issue which particular shape of the sample profile B{r) should be selected 
as a model Will be studied later in this chapter. Here we will only study how die photon count 
number distribution of a single species can be calculated for a given sample profile. We may 
divide the sample into a great number of volume elements and assume that within each of ~. 
theni the intensity of a molecule is constant. Contribution to photon count number 
distribution from a volume element is therefore double Poissonian with parameters cdV arid 
^YB(r). (Here q denotes count rate from a molecule in a selected standard position where , . 
B = l,and£(r)is the profile function of coordinates.) The overall distribution of the number 
of photon counts can be expressed as a convolution integral over double Poissonian 
distributions. Note that integration is a one-dimensional rather than a thiecHdimerisional 
problem here, because the result of integration docs not depend on actual positions of volume 
elements in respect to each other. Figuratively, we may rearrange the u>ieerdimensional array: 
of volume elements into a one-dimensional array, for example in the decreasing order of the 
value of B. 

In a number of first experiments in our laboratory, the photon count number distribution was 
indeed fitted, using the convolution technique. Our sample model consisted of twenty spatial 
.sections, each characterized by its volume V, and brightness b r However, a convenient and 
much faster computation technique exists as described in the next subsection. 



4. Photon count number distribution c rresponding to an arbitrary sample profile: tb 
technique of the generating function 

The forma] definition of the generating function of a distribution P(n) is as follows: 

In particular, if one selects { = e t? , then the distribution P{n) and its generating function 
G(q>) are interrelated by the Fourier transform. What makes the generating function attractive 
in count number distribution analysis is the additivity of its logarithm: logarithms of 
generating functions of photon count number distributions of independent sources, like 
different volume elements as well as different species, are simply added for the calculation of 
the generating function of the combined distribution. (This is so because the transformation 
(8) maps distribution convolutions into the products of the corresponding generating 
functions.) Applying the definition (8) to formula (6) with c -» cd V and q -> qB{r) .the 
contribution from a particular species and a selected volume element dV can be written as 

Therefore, the generating function of the total photon count number distribution can be 
expressed in a closed form 

C(^>-exp(ScJ( e «- , "" ( "-l)^]. (10) 

Numeric integration according to Eq. (9) followed by a fast Fourier transform is to our 
knowledge the motf effective means of calculating the theoretical distribution P(n) 
corresponding to a given sample (i.e., given concentrations and specific brightness values of 
fluorescent species). 






5. Sample profile models 



The most wideiy.used spatial profile modeJ in FCS is ihe ihrBc-dimcnsiooal Gaussian profile, 
with a single parameter of shape, the axial dimension ratio in longitudinal and radial ' - 
directions Logically, ihe fint step in photon count nurnber distribuUon analysis would be 
Tilling a count number distribution obtained for single species like xeU^thylrhodamine, sec 
Fig.l . Residuals curve with open circles illustrates the pc quality obtainable wiih the Gaussian 
profile. There are large systematic deviations in residuals. What is a sufficiently flexible ; 
model for ruling FCS data has turned out to be an absolutely inflexible and inadequate model 



for FID A. 



A model of the sample profile, which has yielded a better fit of me measured distribuuon Pfn; 
is GaussUh-Lor^^ open squares), but this model srill lacks.flexib^ 

that according to Hq, (8), a certain function of the Bpatial brightness B is integrated over the 
volume. In other words. ii is a relationship between B and V, characterizing a given spatial 
brightness profile in FID A. For example, the Gaussian profile yields the relationship 



dV r 
ax 



(11) 
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where we have denoted * = -In B. The Gauss-Lorentzian profile yields the relationship 



0.2) 



Both of the relationships are inflexible, ix.. they do not provide any spatial shape parameters 
to adjust the theoretically calculated distribution to fit the measured data. . .. 

When looking for sufficiently flexible models to fit experimental data, we ended at the ; 
following relationship: 



dx 



(13) 



There is a formal rather than a physical model behind Eq. (13). The fie quality obtainable with 
Eq. (13) is illustrated by the filled squares curve f Fig. 1. 
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Fig. 1 . A distribution of the number of photon counts measured at T = 40 ua for about iCr M 
solution of tetrarocthylrhodamine in water, data collection time 60 s. Three residuals curves 
are presented, corresponding to the best fit obtained with Eqs (1 1), (12), and (13). 






6/Distributkm ^ 

Some fluorescent species may have significantly wide distribution of specific brightness. For 
example vesicles, which arelikely to have a significandy broad size distribution and a rand m 
number of receptors, niay have trapped a random number of labeled ligand molecules. In order 
to fit count number distributions fox samples containing such kind of species, we have : 
modified Eq: (10) in the following manner. We assume that the distribution of brightness of 
particles q within a species is mathematically expressed as follows: . A 



Piq) « q * 



(14) 
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This expression has been selected for the sake of convenience: all moments of this distribuuon 
can be analytically calculated, using the following formula: 



J x e 



(15) 



It is straightforward to derive the modified generating function of a photon count number 
distribution. Following the arguments of section 2.3 one can rewrite Ecj. (9) as follows; 




C{0 = c*P&c t jdV\ dq/*q\a t .b t )(e 

■ ' * * 

where • 



(16) 



b' 

fXqio.b) " f(a) 
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The integral over g can be perf ormed. analytically: 



f V ; T-i 



(18) 



The parameters a, and 4; are related to the mean brightness 4 and the 



8 



width of (he brightness distribution a] by 



7. Weighting in FIDA 

In the range of obtained count numbers, the probability to obtain a particular count number 
usually varies by many orders of magnitude, see for example the distribution of Fig. 1. 
Consequently, the variance of the number of events with a given count number has a strong 
dependence on the count number. To determine weights for least squares fitting, let us assume 
for simplification that light intensities in all counting intervals are independent. Under this 
assumption, we have a problem of distributing M events over choices of different count 
numbers n, each particular outcome having a given probability of realization, P(n). 
Covariance matrix elements of the distribution can be expressed as follows: 

(6J>(n)6J>(m)} = — — . (20) 

where M is the number of counting intervals per experiment. 

For a further simplification, one may ignore the second term on the right side of Eq. (20)! 
which can be interpreted as a consequence of normalization (see Appendix). In this case, the 
weights simply equal to the inverse values of the diagonal eovariance matrix elements 



8. Data simulation algorithms 

Data simulation is a convenient means of testing FIDA algorithms and debugging the 
corresponding computer programs. As an illustration, in Fig. 2, a count number distribution is 
simulated for a case which models a binding reaction of a Labeled ligand to vesicles. 
Distribution with open circles corresponds to the free ligand alone; distribution with open 




^are* cordon* to labeled vesicles; and oistribuu n with filled squares corresponds to 
.Keir mixture. Concen^uons and specific brightness values have beet, se.ecced to model a 
realistic situation in drug screening. 
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of photon counts. This procedure is repeated M times to obtain a simulated count number 
distribution. 

9. Statistical errors of estimated parameters 

In general, a linear or linearized least squares fitting returns not only the values of the 
estimated parameters, but also their covariance matrix, provided the weights have been 
meaningfully set. It may turn out to be possible to express the statistical errors of the estimated 
parameters analytically in some simple cases (e.g. for the rectangular sample profile and 
single species) but in applications at least two-component analysis is usually of interest. 
Therefore, we have been satisfied wilh the numerical calculations of statistical errors, In 
addition to the ^theoretical" errors with the assumption of uncorrected measurements (Eq. 
(20)), in some cases we have determined statistical errors empirically, making a series of 
about a hundred FIDA experiments on identical conditions. As a rule, empirical errors are 
higher than theoretical ones by a factor of three to four. Empirical errors appear to be closer to 
the theoretical ones in scanning experiments. Therefore we axe convinced that the main reason 
of the underestimation of theoretical errors is the assumption of uncorrclatcd measurements. 

Figs. 3-8 illustrate how theoretical errors depend on the counting interval r. concentrations 
aid brightness values. Table 1 compares statistical errors of parameters estimated by fitting a 
photon count number distribution (FIDA) and by the moment analysis. FIDA is 
oveiwhelmingly better than the moment analysis if the number if estimated parameters is 
higher than three. 
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Fig: 3. Theoretical errors of the estimated parameters c and q of a solution of single species, 
depending on the value of q. The following values of experimental parameters were selected: 
c = 1.0; t 20 us; a y ~ -0.4; a 2 = 0.08;7> = 1.0 kHz; data collection time 2 s. 
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Fig 4. Theoretical errors of che estimated paremeiett c and q of a solution of single species, 
depending on the value of c. The following values of experimental parameters were selected; 
q = 60 kHz/particle; T= 20 >is; a, = -0.4; a z - 0.08; b = 1.0 kHz; data collection time 2 s. 
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hg, 5- Theoretical errors of the estimated parameters c and q of a solution of single species, 
depending on the value of T. The following values of experimental parameters were selected: 
c = 1.0; q - 30.0 kHz/particle (upper graphs); q - 60.0 kHz/particle (lower graphs); ai = -0.4; 
A3 = 0.08; b - 1.0 kHz; data collection time 2 s. 
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Fig. 6. Theoretical errors of the estimated parameters c and q of a mixture of two species, 
depending on the value of T. The following values of experimental parameters were selected 
c x = 0.1; c-2 = 2.0; q x - 200.0 kHz/particle; q 2 - 10.0 kHz/particle; a = -0.4; a x - 0.08; b = 1.0 
kHz; data collection time 1 0 s. Note that the optimal value of T for the determination of the 
parameters of the brighter species is lower than that of the darker species. 
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Fie 7 Theoretical errors of the estimated paraineten c and g of a n^oue of two species, 
deeding on the ratio of q 2 to q x . The following values of "P"***^ p^acrs were 
Sled: *, = 150.0 JcHz/particle; c, = o = 0.5; * = -0-4: 0, = 0.08; b = 1 .0 kHz; data 
collection time 40 s. ^ 
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following values of experimental parameters were selected; q } - 75.0 kHz/particle; q 2 - 25 0 
kHz/particle; a, = -0.4; a 2 = 0.08; b = i.O kHz; data collection time 60 s. Note that'an optimal 
concentration exists at about one particle per sample volume. This is generally true, except if 
leas than three parameters are to be determined. 
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40.0 



40.0 



40.0 



estimated 
parameters 



Parameter 
specification 



2L 



Value 
(<?s in 
kHz) 



3L 



Percent 
error of 
FIDA 



0.05 2.00 

150,0 1 .54 

3.0 J 0.53 
5.0 (fixed) 



0.05 
150.0 
3.0 
5.0 



2.26 

1,63 
3,18 
3.35 




Table 1. Statistical errors of estimated parameters from least squares fitting of photon count 
number distributions (FIDA) and from moment analysis. Error values are determined 
through processing a series of simulated distributions. 



Appendix 

Dispersion matrix (20) corresponds to the multinomial distribution of statistical realizations of 
histograms. We wiU show here that the Poissonian distribution, with the constraint that the 
local number of counting intervals M is fixed, will lead to the multinomial distribution. This 
is the rationale behind using Poissonian weights as given in Eq. (21). 

Let ^ be the expectation value of the number of events of counting k photons and let 

M a £*> be their sum. Let be a statistical realization with Af-J^ . Assume that 

realizations obey Poissonian statistics 



r 



[n^.m, ,...]! 



(AI) 






where wehav introduced the, notation /i 0 "V"- '". L - The probability of 

having the total of M events is ; ... ^ ; ^ 



N* • -- " • /V ' (A2) 



The conditional probability of-having . . events if there is a total of M events is 
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This u the multinomial distribution where we have introduced: 



. n 
Pt 9 M 



and C " are multinomial coefficients. 
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Generation of a random Foiwon number is Ihc following. For * given expected value of events E. a simulated 
number oi event* 5 is determined from a routinely generated nndem number R between 0.0 and 1.0 through 

inequation* £ Poisson{i\ £) < R £ £ Poisson{i\ £), 



r>4 
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